112 research outputs found
MDDL: A Framework for Reinforcement Learning-based Position Allocation in Multi-Channel Feed
Nowadays, the mainstream approach in position allocation system is to utilize
a reinforcement learning model to allocate appropriate locations for items in
various channels and then mix them into the feed. There are two types of data
employed to train reinforcement learning (RL) model for position allocation,
named strategy data and random data. Strategy data is collected from the
current online model, it suffers from an imbalanced distribution of
state-action pairs, resulting in severe overestimation problems during
training. On the other hand, random data offers a more uniform distribution of
state-action pairs, but is challenging to obtain in industrial scenarios as it
could negatively impact platform revenue and user experience due to random
exploration. As the two types of data have different distributions, designing
an effective strategy to leverage both types of data to enhance the efficacy of
the RL model training has become a highly challenging problem. In this study,
we propose a framework named Multi-Distribution Data Learning (MDDL) to address
the challenge of effectively utilizing both strategy and random data for
training RL models on mixed multi-distribution data. Specifically, MDDL
incorporates a novel imitation learning signal to mitigate overestimation
problems in strategy data and maximizes the RL signal for random data to
facilitate effective learning. In our experiments, we evaluated the proposed
MDDL framework in a real-world position allocation system and demonstrated its
superior performance compared to the previous baseline. MDDL has been fully
deployed on the Meituan food delivery platform and currently serves over 300
million users.Comment: 4 pages, 2 figures, accepted by SIGIR 202
Multi-time scale stochastic production simulation under VHVDC long-term contract trading electricity quantity constraint
A two-layer multi-time scale stochastic production simulation framework is constructed to account for the long-term contract electricity quantity of ultra-high voltage direct current (UHVDC) transmission. On the upper layer, based on the characteristics of load demand and renewable energy output extracted from the historical operating data, monthly and daily production simulation models are carried out considering the seasonal characteristics of hydropower during a high-water period and low-water period to optimize the distribution of contract electric quantity sending through UHVDC transmission in the target year or month. According to the DC transmission electric quantity optimized by the daily production simulation in the upper layer, together with the forecast scenario, the lower layer of the framework provides the optimization of day-ahead scheduling and intra-day rolling dispatch in the implementation process. The day-ahead dispatch optimization makes full use of the adjustment capability of transmission and optimizes the DC transmission electric quantity correction. Its compensation is based on the result of the daily production simulation, then the correction will be returned to the upper layer to restart the optimization of the remaining UHVDC contract electric quantity of the subsequent period and its distribution plan. Combined with the day-ahead DC transmission plan, the intra-day rolling optimization is carried out to adjust the output of the unit using more accurate forecasting scenarios. The distributionally robust optimization model is used in the lower layer to convert an uncertain problem into a deterministic quadratically constrained quadratic programming (QCQP) problem according to the form of an uncertain distribution set. Then the QCQP problem is further converted into a linear programming (LP) problem by using the reformulation linearization technique (RLT). A test system with the energy composition and distribution referring to a real provincial power grid in northwest China is established for verification. The results show that the proposed method can effectively improve the economics of system operation and the accommodation of renewable energy based on ensuring security
Long-Term Organic Farming Manipulated Rhizospheric Microbiome and Bacillus Antagonism Against Pepper Blight (Phytophthora capsici)
Soil-borne diseases are often less severe in organic farms, possibly because of the recruitment of beneficial microorganisms by crops. Here, the suppressiveness of organic, integrated, and conventionally managed soils to pepper blight (Phytophthora capsici) was studied in growth chamber experiments. Disease incidence was 41.3 and 34.1% lower in the soil from an organic farming system than in either the soil from the integrated or from the conventional farming systems, respectively. Beta-diversity of rhizospheric microbial communities differed among treatments, with enrichment of Bacillus, Sporosarcina, Acidobacteria Gp5, Gp6, Gp22, and Ignavibacterium by the organic soil. Cultivation-dependent analysis indicated that 50.3% of in vitro antagonists of P. capsici isolated from the rhizosphere of healthy peppers were affiliated to Bacillus. An integration of in vitro antagonists and bacterial diversity analyses indicated that Bacillus antagonists were higher in the rhizosphere of pepper treated by the organic soil. A microbial consortium of 18 in vitro Bacillus antagonists significantly increased the suppressiveness of soil from the integrated farming system against pepper blight. Overall, the soil microbiome under the long-term organic farming system was more suppressive to pepper blight, possibly owing to Bacillus antagonism in the rhizosphere. This study provided insights into microbiome management for disease suppression under greenhouse conditions
Recommended from our members
A Novel Signal Transduction Pathway that Modulates <i>rhl</i> Quorum Sensing and Bacterial Virulence in <i>Pseudomonas aeruginosa</i>
The rhl quorum-sensing (QS) system plays critical roles in the pathogenesis of P. aeruginosa. However, the regulatory effects that occur directly upstream of the rhl QS system are poorly understood. Here, we show that deletion of gene encoding for the two-component sensor BfmS leads to the activation of its cognate response regulator BfmR, which in turn directly binds to the promoter and decreases the expression of the rhlR gene that encodes the QS regulator RhlR, causing the inhibition of the rhl QS system. In the absence of bfmS, the Acka-Pta pathway can modulate the regulatory activity of BfmR. In addition, BfmS tunes the expression of 202 genes that comprise 3.6% of the P. aeruginosa genome. We further demonstrate that deletion of bfmS causes substantially reduced virulence in lettuce leaf, reduced cytotoxicity, enhanced invasion, and reduced bacterial survival during acute mouse lung infection. Intriguingly, specific missense mutations, which occur naturally in the bfmS gene in P. aeruginosa cystic fibrosis (CF) isolates such as DK2 strains and RP73 strain, can produce BfmS variants (BfmSL181P, BfmSL181P/E376Q, and BfmSR393H) that no longer repress, but instead activate BfmR. As a result, BfmS variants, but not the wild-type BfmS, inhibit the rhl QS system. This study thus uncovers a previously unexplored signal transduction pathway, BfmS/BfmR/RhlR, for the regulation of rhl QS in P. aeruginosa. We propose that BfmRS TCS may have an important role in the regulation and evolution of P. aeruginosa virulence during chronic infection in CF lungs.</p
Potential of Core-Collapse Supernova Neutrino Detection at JUNO
JUNO is an underground neutrino observatory under construction in Jiangmen, China. It uses 20kton liquid scintillator as target, which enables it to detect supernova burst neutrinos of a large statistics for the next galactic core-collapse supernova (CCSN) and also pre-supernova neutrinos from the nearby CCSN progenitors. All flavors of supernova burst neutrinos can be detected by JUNO via several interaction channels, including inverse beta decay, elastic scattering on electron and proton, interactions on C12 nuclei, etc. This retains the possibility for JUNO to reconstruct the energy spectra of supernova burst neutrinos of all flavors. The real time monitoring systems based on FPGA and DAQ are under development in JUNO, which allow prompt alert and trigger-less data acquisition of CCSN events. The alert performances of both monitoring systems have been thoroughly studied using simulations. Moreover, once a CCSN is tagged, the system can give fast characterizations, such as directionality and light curve
Detection of the Diffuse Supernova Neutrino Background with JUNO
As an underground multi-purpose neutrino detector with 20 kton liquid scintillator, Jiangmen Underground Neutrino Observatory (JUNO) is competitive with and complementary to the water-Cherenkov detectors on the search for the diffuse supernova neutrino background (DSNB). Typical supernova models predict 2-4 events per year within the optimal observation window in the JUNO detector. The dominant background is from the neutral-current (NC) interaction of atmospheric neutrinos with 12C nuclei, which surpasses the DSNB by more than one order of magnitude. We evaluated the systematic uncertainty of NC background from the spread of a variety of data-driven models and further developed a method to determine NC background within 15\% with {\it{in}} {\it{situ}} measurements after ten years of running. Besides, the NC-like backgrounds can be effectively suppressed by the intrinsic pulse-shape discrimination (PSD) capabilities of liquid scintillators. In this talk, I will present in detail the improvements on NC background uncertainty evaluation, PSD discriminator development, and finally, the potential of DSNB sensitivity in JUNO
- …